SWORD - a highly efficient protein database search

نویسندگان

  • Robert Vaser
  • Dario Pavlovic
  • Mile Sikic
چکیده

MOTIVATION Protein database search is one of the fundamental problems in bioinformatics. For decades, it has been explored and solved using different exact and heuristic approaches. However, exponential growth of data in recent years has brought significant challenges in improving already existing algorithms. BLAST has been the most successful tool for protein database search, but is also becoming a bottleneck in many applications. Due to that, many different approaches have been developed to complement or replace it. In this article, we present SWORD, an efficient protein database search implementation that runs 8-16 times faster than BLAST in the sensitive mode and up to 68 times faster in the fast and less accurate mode. It is designed to be used in nearly all database search environments, but is especially suitable for large databases. Its sensitivity exceeds that of BLAST for majority of input datasets and provides guaranteed optimal alignments. AVAILABILITY AND IMPLEMENTATION Sword is freely available for download from https://github.com/rvaser/sword CONTACT [email protected] and [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Effective Path-aware Approach for Keyword Search over Data Graphs

Abstract—Keyword Search is known as a user-friendly alternative for structured languages to retrieve information from graph-structured data. Efficient retrieving of relevant answers to a keyword query and effective ranking of these answers according to their relevance are two main challenges in the keyword search over graph-structured data. In this paper, a novel scoring function is proposed, w...

متن کامل

A Recyclable Poly(ionic liquid)s Enzyme Reactor for Highly Efficient Protein Digestion

One of the most significant tasks for proteomic research and industrial applications, is the preparation of recyclable enzyme reactor. Herein, a novel recyclable enzyme reactor has been developed based on monodispersed spherical poly(quaternary ammonium ionic liquid)s particles immobilized trypsin. A new quaternary ammonium ionic liquids functional monomer was first synthesized. The ionic l...

متن کامل

Highly Efficient Transfection of Dendritic Cells Derived from Esophageal Squamous Cell Carcinoma Patient: Optimization with Green Fluorescent Protein and Validation with Tumor RNA as a Tool for Immuno-genetherapy

This study was conducted to optimize a highly efficient mRNA transfection into dendritic cells (DC) derived from esophageal squamous cell carcinoma (ESCC) patients. Applying an electroporation technique, in vitro synthesized Green Fluorescent Protein (GFP) mRNA was transfected as an indicator into the DCs derived from a healthy donor. Flow cytometry revealed 84.9% transfection efficiency for DC...

متن کامل

Geometric Suffix Tree: A New Index Structure for Protein 3-D Structures

Protein structure analysis is one of the most important research issues in the post-genomic era, and faster and more accurate query data structures for such 3-D structures are highly desired for research on proteins. This paper proposes a new data structure for indexing protein 3-D structures. For strings, there are many efficient indexing structures such as suffix trees, but it has been consid...

متن کامل

A Query-oriented XML Fragment Search Approach on A Relational Database System

In this paper, we propose a query-oriented fragment search approach for efficient and effective XML search engines on relational database systems. Conventional approaches for XML fragment search have only considered statistics of XML documents stored in a database system to rank a search result. As a result, almost all XML fragment search engines have attained negative results in the research f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 32 17  شماره 

صفحات  -

تاریخ انتشار 2016